Overview

Dataset statistics

Number of variables19
Number of observations37113
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.4 MiB
Average record size in memory152.0 B

Variable types

NUM15
CAT4

Reproduction

Analysis started2021-05-20 09:34:33.437759
Analysis finished2021-05-20 09:35:06.552420
Duration33.11 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

done_at has a high cardinality: 37069 distinct values High cardinality
travel_start_time has a high cardinality: 37077 distinct values High cardinality
trip_id is highly correlated with idHigh correlation
id is highly correlated with trip_idHigh correlation
g_distance is highly correlated with durationHigh correlation
duration is highly correlated with g_distanceHigh correlation
done_at is uniformly distributed Uniform
travel_start_time is uniformly distributed Uniform
id has unique values Unique
trip_start has unique values Unique
duration has 1616 (4.4%) zeros Zeros
g_distance has 1608 (4.3%) zeros Zeros

Variables

id
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct count37113
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean835275.62748902
Minimum100274
Maximum2153861
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum100274
5-th percentile257747.2
Q1426952
median667723
Q31327766
95-th percentile1672961.2
Maximum2153861
Range2053587
Interquartile range (IQR)900814

Descriptive statistics

Standard deviation495616.6425
Coefficient of variation (CV)0.5933570024
Kurtosis-0.8782917444
Mean835275.6275
Median Absolute Deviation (MAD)302165
Skewness0.6401494799
Sum3.099958436e+10
Variance2.456358564e+11
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
4628461< 0.1%
 
21188141< 0.1%
 
3372301< 0.1%
 
5399811< 0.1%
 
15877601< 0.1%
 
3966191< 0.1%
 
1979621< 0.1%
 
4867271< 0.1%
 
4191421< 0.1%
 
2819231< 0.1%
 
Other values (37103)37103> 99.9%
 
ValueCountFrequency (%) 
1002741< 0.1%
 
1003271< 0.1%
 
1004401< 0.1%
 
1005481< 0.1%
 
1005621< 0.1%
 
ValueCountFrequency (%) 
21538611< 0.1%
 
21538191< 0.1%
 
21534291< 0.1%
 
21532271< 0.1%
 
21527741< 0.1%
 

start_lat
Real number (ℝ≥0)

Distinct count15
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52.02249676871987
Minimum51.3614965
Maximum52.6002058
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum51.3614965
5-th percentile52.0201841
Q152.0201841
median52.0201841
Q352.02039
95-th percentile52.02039
Maximum52.6002058
Range1.2387093
Interquartile range (IQR)0.0002059

Descriptive statistics

Standard deviation0.03571243005
Coefficient of variation (CV)0.0006864805088
Kurtosis218.7775273
Mean52.02249677
Median Absolute Deviation (MAD)0
Skewness13.11702163
Sum1930710.923
Variance0.00127537766
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
52.02018412258460.9%
 
52.020391433638.6%
 
52.5380851150.3%
 
52.0204466210.1%
 
52.6000377190.1%
 
52.516774712< 0.1%
 
52.5380869< 0.1%
 
51.36149655< 0.1%
 
52.53775713< 0.1%
 
52.02012092< 0.1%
 
Other values (5)7< 0.1%
 
ValueCountFrequency (%) 
51.36149655< 0.1%
 
52.02012092< 0.1%
 
52.02018412258460.9%
 
52.02020351< 0.1%
 
52.020391433638.6%
 
ValueCountFrequency (%) 
52.60020581< 0.1%
 
52.60020572< 0.1%
 
52.6000377190.1%
 
52.59919392< 0.1%
 
52.53967871< 0.1%
 

start_lon
Real number (ℝ≥0)

Distinct count16
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.158654064265891
Minimum4.7450827
Maximum6.1740619
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum4.7450827
5-th percentile5.15492
Q15.15492
median5.1550517
Q35.1550517
95-th percentile5.1550517
Maximum6.1740619
Range1.4289792
Interquartile range (IQR)0.0001317

Descriptive statistics

Standard deviation0.06345695791
Coefficient of variation (CV)0.01230106867
Kurtosis239.1209558
Mean5.158654064
Median Absolute Deviation (MAD)0
Skewness15.09923149
Sum191453.1283
Variance0.004026785507
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5.15505172258060.8%
 
5.154921433638.6%
 
6.1654091150.3%
 
5.1557132210.1%
 
4.7450827190.1%
 
6.083021912< 0.1%
 
6.16540759< 0.1%
 
6.14197775< 0.1%
 
5.15505164< 0.1%
 
6.16633693< 0.1%
 
Other values (6)9< 0.1%
 
ValueCountFrequency (%) 
4.7450827190.1%
 
4.7454612< 0.1%
 
4.74580262< 0.1%
 
4.74580271< 0.1%
 
5.15387342< 0.1%
 
ValueCountFrequency (%) 
6.17406191< 0.1%
 
6.16633693< 0.1%
 
6.1654091150.3%
 
6.16540759< 0.1%
 
6.14197775< 0.1%
 

stop_lat
Real number (ℝ≥0)

Distinct count14336
Unique (%)38.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52.133312704987475
Minimum50.76751
Maximum53.41821
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum50.76751
5-th percentile51.41412
Q151.93012
median52.1237004
Q352.3626641
95-th percentile52.855926
Maximum53.41821
Range2.6507
Interquartile range (IQR)0.4325441

Descriptive statistics

Standard deviation0.4232640378
Coefficient of variation (CV)0.008118878619
Kurtosis1.281902811
Mean52.1333127
Median Absolute Deviation (MAD)0.2265396
Skewness-0.1950817722
Sum1934823.634
Variance0.1791524457
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
52.17082682280.6%
 
52.20943131940.5%
 
52.31495891880.5%
 
52.04529391840.5%
 
52.29473011810.5%
 
51.93907161800.5%
 
52.33252511770.5%
 
52.29399831740.5%
 
52.33426711610.4%
 
52.36537831540.4%
 
Other values (14326)3529295.1%
 
ValueCountFrequency (%) 
50.767511< 0.1%
 
50.768791< 0.1%
 
50.768831< 0.1%
 
50.769341< 0.1%
 
50.769363< 0.1%
 
ValueCountFrequency (%) 
53.418212< 0.1%
 
53.41791< 0.1%
 
53.417511< 0.1%
 
53.416871< 0.1%
 
53.416681< 0.1%
 

stop_lon
Real number (ℝ≥0)

Distinct count14598
Unique (%)39.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.139928943979198
Minimum3.3778
Maximum7.20996
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum3.3778
5-th percentile4.3005051
Q14.66312
median5.0171841
Q35.52257
95-th percentile6.493056
Maximum7.20996
Range3.83216
Interquartile range (IQR)0.85945

Descriptive statistics

Standard deviation0.6522013548
Coefficient of variation (CV)0.1268891772
Kurtosis0.1827111995
Mean5.139928944
Median Absolute Deviation (MAD)0.3921859
Skewness0.7634924307
Sum190758.1829
Variance0.4253666072
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5.36956592250.6%
 
5.02092961940.5%
 
4.97770841880.5%
 
4.53761131840.5%
 
4.56312921810.5%
 
4.95573111810.5%
 
4.86060281770.5%
 
4.87326481740.5%
 
4.86696311610.4%
 
4.88485121530.4%
 
Other values (14588)3529595.1%
 
ValueCountFrequency (%) 
3.37784< 0.1%
 
3.387153< 0.1%
 
3.441921< 0.1%
 
3.44582< 0.1%
 
3.450391< 0.1%
 
ValueCountFrequency (%) 
7.209961< 0.1%
 
7.208541< 0.1%
 
7.200041< 0.1%
 
7.19371< 0.1%
 
7.18091< 0.1%
 

quantity
Real number (ℝ≥0)

Distinct count76
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.806132621992294
Minimum0
Maximum130
Zeros7
Zeros (%)< 0.1%
Memory size289.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32
95-th percentile18
Maximum130
Range130
Interquartile range (IQR)1

Descriptive statistics

Standard deviation6.888915739
Coefficient of variation (CV)1.809951576
Kurtosis25.21769697
Mean3.806132622
Median Absolute Deviation (MAD)0
Skewness4.076295951
Sum141257
Variance47.45716006
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
12375964.0%
 
2416411.2%
 
314844.0%
 
49212.5%
 
57312.0%
 
65391.5%
 
74511.2%
 
94131.1%
 
84071.1%
 
103561.0%
 
Other values (66)388810.5%
 
ValueCountFrequency (%) 
07< 0.1%
 
12375964.0%
 
2416411.2%
 
314844.0%
 
49212.5%
 
ValueCountFrequency (%) 
1302< 0.1%
 
1021< 0.1%
 
1001< 0.1%
 
891< 0.1%
 
861< 0.1%
 

done_at
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count37069
Unique (%)99.9%
Missing0
Missing (%)0.0%
Memory size289.9 KiB
2019-05-22 20:43:48+00:00
 
6
2020-06-04 02:56:54+00:00
 
5
2020-01-25 02:29:33+00:00
 
2
2020-04-03 09:12:09+00:00
 
2
2019-08-14 00:34:55+00:00
 
2
Other values (37064)
37096
ValueCountFrequency (%) 
2019-05-22 20:43:48+00:006< 0.1%
 
2020-06-04 02:56:54+00:005< 0.1%
 
2020-01-25 02:29:33+00:002< 0.1%
 
2020-04-03 09:12:09+00:002< 0.1%
 
2019-08-14 00:34:55+00:002< 0.1%
 
2020-04-10 13:10:23+00:002< 0.1%
 
2019-11-14 09:31:00+00:002< 0.1%
 
2019-04-25 17:25:19+00:002< 0.1%
 
2019-09-19 18:07:29+00:002< 0.1%
 
2019-09-04 04:41:29+00:002< 0.1%
 
Other values (37059)3708699.9%
 

Length

Max length32
Median length25
Mean length25.81612912
Min length25

trip_start
Categorical

UNIQUE

Distinct count37113
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size289.9 KiB
2019-09-23 23:24:20.176111+00:00
 
1
2021-02-22 23:34:53.361000+00:00
 
1
2020-12-10 23:08:26.927000+00:00
 
1
2019-11-22 11:14:28.864315+00:00
 
1
2021-02-27 00:53:06.747000+00:00
 
1
Other values (37108)
37108
ValueCountFrequency (%) 
2019-09-23 23:24:20.176111+00:001< 0.1%
 
2021-02-22 23:34:53.361000+00:001< 0.1%
 
2020-12-10 23:08:26.927000+00:001< 0.1%
 
2019-11-22 11:14:28.864315+00:001< 0.1%
 
2021-02-27 00:53:06.747000+00:001< 0.1%
 
2020-02-28 03:26:24.093677+00:001< 0.1%
 
2020-01-15 06:26:33.668097+00:001< 0.1%
 
2020-07-29 05:16:49.823475+00:001< 0.1%
 
2020-05-04 23:15:04.023822+00:001< 0.1%
 
2019-11-20 03:03:34.829755+00:001< 0.1%
 
Other values (37103)37103> 99.9%
 

Length

Max length32
Median length32
Mean length31.99886832
Min length25

trip_id
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count5272
Unique (%)14.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74161.2301888826
Minimum24309
Maximum145760
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum24309
5-th percentile35552.2
Q149327
median68873
Q3100242
95-th percentile120019
Maximum145760
Range121451
Interquartile range (IQR)50915

Descriptive statistics

Standard deviation28610.67344
Coefficient of variation (CV)0.3857901679
Kurtosis-0.9655626489
Mean74161.23019
Median Absolute Deviation (MAD)22420
Skewness0.3897918243
Sum2752345736
Variance818570634.8
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
83030380.1%
 
78210260.1%
 
114876250.1%
 
111601240.1%
 
79707230.1%
 
98433230.1%
 
44256230.1%
 
78233230.1%
 
44790220.1%
 
56720220.1%
 
Other values (5262)3686499.3%
 
ValueCountFrequency (%) 
2430911< 0.1%
 
245073< 0.1%
 
245714< 0.1%
 
246101< 0.1%
 
246202< 0.1%
 
ValueCountFrequency (%) 
1457601< 0.1%
 
1457583< 0.1%
 
1457501< 0.1%
 
1457491< 0.1%
 
1457481< 0.1%
 

customer_id
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size289.9 KiB
866
24086
1
13027
ValueCountFrequency (%) 
8662408664.9%
 
11302735.1%
 

Length

Max length3
Median length3
Mean length2.297981839
Min length1

driver_id
Real number (ℝ≥0)

Distinct count279
Unique (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean886.2831352895212
Minimum26.0
Maximum16731.0
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum26
5-th percentile117
Q1427
median598
Q3794
95-th percentile1136
Maximum16731
Range16705
Interquartile range (IQR)367

Descriptive statistics

Standard deviation1676.932599
Coefficient of variation (CV)1.892095802
Kurtosis32.70876532
Mean886.2831353
Median Absolute Deviation (MAD)183
Skewness5.62633015
Sum32892626
Variance2812102.943
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
74411933.2%
 
43111533.1%
 
11711353.1%
 
7159122.5%
 
6218662.3%
 
2428102.2%
 
4507512.0%
 
7427472.0%
 
4157392.0%
 
5557141.9%
 
Other values (269)2809375.7%
 
ValueCountFrequency (%) 
26190.1%
 
322130.6%
 
441< 0.1%
 
491< 0.1%
 
703631.0%
 
ValueCountFrequency (%) 
167319< 0.1%
 
16557210.1%
 
164025< 0.1%
 
15347410.1%
 
147531< 0.1%
 

kilometers
Real number (ℝ≥0)

Distinct count713
Unique (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2091.339368954275
Minimum0.0
Maximum725974.0
Zeros95
Zeros (%)0.3%
Memory size289.9 KiB

Quantile statistics

Minimum0
5-th percentile137
Q1205
median262
Q3378
95-th percentile579.4
Maximum725974
Range725974
Interquartile range (IQR)173

Descriptive statistics

Standard deviation25196.20523
Coefficient of variation (CV)12.04787975
Kurtosis379.2525359
Mean2091.339369
Median Absolute Deviation (MAD)71
Skewness17.87822476
Sum77615878
Variance634848758.2
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1963270.9%
 
2292420.7%
 
2112380.6%
 
2472380.6%
 
2572310.6%
 
2032280.6%
 
2352250.6%
 
2182250.6%
 
2492230.6%
 
2522210.6%
 
Other values (703)3471593.5%
 
ValueCountFrequency (%) 
0950.3%
 
71< 0.1%
 
161< 0.1%
 
181< 0.1%
 
1917< 0.1%
 
ValueCountFrequency (%) 
7259743< 0.1%
 
68543611< 0.1%
 
57303612< 0.1%
 
5578281< 0.1%
 
5317622< 0.1%
 

stop_start_lat
Real number (ℝ≥0)

Distinct count13222
Unique (%)35.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52.121142956266006
Minimum50.76751
Maximum53.4366429
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum50.76751
5-th percentile51.45431
Q151.97508
median52.07474
Q352.34659
95-th percentile52.791646
Maximum53.4366429
Range2.6691329
Interquartile range (IQR)0.37151

Descriptive statistics

Standard deviation0.3996417328
Coefficient of variation (CV)0.007667555048
Kurtosis1.910975717
Mean52.12114296
Median Absolute Deviation (MAD)0.1884001
Skewness-0.1003403981
Sum1934371.979
Variance0.1597135146
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
52.020184134189.2%
 
52.0203917994.8%
 
52.17082682010.5%
 
52.31495891820.5%
 
52.33252511670.4%
 
51.93907161670.4%
 
52.36537831510.4%
 
52.36266411500.4%
 
52.40173891490.4%
 
52.352041480.4%
 
Other values (13212)3058182.4%
 
ValueCountFrequency (%) 
50.767511< 0.1%
 
50.768791< 0.1%
 
50.768831< 0.1%
 
50.769341< 0.1%
 
50.769363< 0.1%
 
ValueCountFrequency (%) 
53.43664291< 0.1%
 
53.418212< 0.1%
 
53.41791< 0.1%
 
53.417511< 0.1%
 
53.416871< 0.1%
 

stop_start_lon
Real number (ℝ≥0)

Distinct count13437
Unique (%)36.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.142898002516638
Minimum3.3778
Maximum7.20996
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum3.3778
5-th percentile4.3063796
Q14.71321
median5.1369355
Q35.4092616
95-th percentile6.4305
Maximum7.20996
Range3.83216
Interquartile range (IQR)0.6960516

Descriptive statistics

Standard deviation0.6117179154
Coefficient of variation (CV)0.1189442052
Kurtosis0.6310828731
Mean5.142898003
Median Absolute Deviation (MAD)0.3607055
Skewness0.8160519563
Sum190868.3736
Variance0.374198808
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5.155051734149.2%
 
5.1549217994.8%
 
5.36956591990.5%
 
4.97770841820.5%
 
4.56312921680.5%
 
4.86060281670.4%
 
4.92749871500.4%
 
4.88485121500.4%
 
4.94384561490.4%
 
4.86496991480.4%
 
Other values (13427)3058782.4%
 
ValueCountFrequency (%) 
3.37784< 0.1%
 
3.387153< 0.1%
 
3.441921< 0.1%
 
3.44582< 0.1%
 
3.450391< 0.1%
 
ValueCountFrequency (%) 
7.209961< 0.1%
 
7.208541< 0.1%
 
7.200041< 0.1%
 
7.19371< 0.1%
 
7.18091< 0.1%
 

travel_start_time
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count37077
Unique (%)99.9%
Missing0
Missing (%)0.0%
Memory size289.9 KiB
2019-05-22 20:43:48+00:00
 
6
2020-06-04 02:56:54+00:00
 
4
2019-10-02 09:57:38+00:00
 
2
2019-09-19 18:07:29+00:00
 
2
2019-06-19 14:02:56+00:00
 
2
Other values (37072)
37097
ValueCountFrequency (%) 
2019-05-22 20:43:48+00:006< 0.1%
 
2020-06-04 02:56:54+00:004< 0.1%
 
2019-10-02 09:57:38+00:002< 0.1%
 
2019-09-19 18:07:29+00:002< 0.1%
 
2019-06-19 14:02:56+00:002< 0.1%
 
2020-04-10 13:10:23+00:002< 0.1%
 
2020-07-31 17:41:19+00:002< 0.1%
 
2020-04-11 13:21:42+00:002< 0.1%
 
2019-08-14 00:34:55+00:002< 0.1%
 
2020-10-14 13:16:31+00:002< 0.1%
 
Other values (37067)3708799.9%
 

Length

Max length32
Median length25
Mean length26.6547032
Min length25

duration
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count4263
Unique (%)11.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.706149237913042
Minimum0.0
Maximum259.71666666666664
Zeros1616
Zeros (%)4.4%
Memory size289.9 KiB

Quantile statistics

Minimum0
5-th percentile0.6433333333
Q19.1
median16
Q328
95-th percentile60.26666667
Maximum259.7166667
Range259.7166667
Interquartile range (IQR)18.9

Descriptive statistics

Standard deviation20.09958721
Coefficient of variation (CV)0.9259858572
Kurtosis9.385494456
Mean21.70614924
Median Absolute Deviation (MAD)8.6
Skewness2.412925331
Sum805580.3167
Variance403.9934059
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
016164.4%
 
2.8166666671790.5%
 
12.266666671570.4%
 
13.71460.4%
 
3.3833333331450.4%
 
8.91270.3%
 
37.566666671100.3%
 
14.283333331100.3%
 
14.116666671090.3%
 
18.916666671010.3%
 
Other values (4253)3431392.5%
 
ValueCountFrequency (%) 
016164.4%
 
0.01666666667530.1%
 
0.033333333334< 0.1%
 
0.057< 0.1%
 
0.066666666672< 0.1%
 
ValueCountFrequency (%) 
259.71666671< 0.1%
 
249.13333331< 0.1%
 
174.18333331< 0.1%
 
169.46666671< 0.1%
 
167.61666671< 0.1%
 

g_distance
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count18751
Unique (%)50.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.743591032791745
Minimum0.0
Maximum255.752
Zeros1608
Zeros (%)4.3%
Memory size289.9 KiB

Quantile statistics

Minimum0
5-th percentile0.1
Q14.087
median11.751
Q328.744
95-th percentile81.0824
Maximum255.752
Range255.752
Interquartile range (IQR)24.657

Descriptive statistics

Standard deviation30.18080574
Coefficient of variation (CV)1.327002658
Kurtosis11.42007324
Mean22.74359103
Median Absolute Deviation (MAD)9.226
Skewness2.863091201
Sum844082.894
Variance910.8810353
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
016084.3%
 
0.9881370.4%
 
9.4391320.4%
 
8.8521250.3%
 
0.961230.3%
 
3.9911110.3%
 
52.5931050.3%
 
9.687950.3%
 
24.176940.3%
 
8.58930.3%
 
Other values (18741)3449092.9%
 
ValueCountFrequency (%) 
016084.3%
 
0.0013< 0.1%
 
0.0022< 0.1%
 
0.0036< 0.1%
 
0.004480.1%
 
ValueCountFrequency (%) 
255.7521< 0.1%
 
255.1661< 0.1%
 
254.2321< 0.1%
 
253.7451< 0.1%
 
251.3761< 0.1%
 

turns
Real number (ℝ≥0)

Distinct count40
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.921914154070002
Minimum1
Maximum43
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum1
5-th percentile2
Q19
median13
Q317
95-th percentile22
Maximum43
Range42
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.627195524
Coefficient of variation (CV)0.4354769314
Kurtosis0.1065620855
Mean12.92191415
Median Absolute Deviation (MAD)4
Skewness-0.007696517568
Sum479571
Variance31.66532947
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1230818.3%
 
1428357.6%
 
1326697.2%
 
1625556.9%
 
1524676.6%
 
1723186.2%
 
1122756.1%
 
1021655.8%
 
918334.9%
 
117484.7%
 
Other values (30)1316735.5%
 
ValueCountFrequency (%) 
117484.7%
 
22180.6%
 
33380.9%
 
45781.6%
 
58162.2%
 
ValueCountFrequency (%) 
431< 0.1%
 
421< 0.1%
 
401< 0.1%
 
382< 0.1%
 
363< 0.1%
 

total_distance
Real number (ℝ≥0)

Distinct count3923
Unique (%)10.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202.93789413413094
Minimum3.54
Maximum718.327
Zeros0
Zeros (%)0.0%
Memory size289.9 KiB

Quantile statistics

Minimum3.54
5-th percentile72.357
Q1130.13
median181.698
Q3259.846
95-th percentile394.22
Maximum718.327
Range714.787
Interquartile range (IQR)129.716

Descriptive statistics

Standard deviation105.6803466
Coefficient of variation (CV)0.5207521594
Kurtosis2.710364341
Mean202.9378941
Median Absolute Deviation (MAD)59.744
Skewness1.273749001
Sum7531634.065
Variance11168.33566
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
91.5791320.4%
 
72.3571300.4%
 
123.3921290.3%
 
276.21110.3%
 
77.341960.3%
 
142.404900.2%
 
60.076870.2%
 
104.955830.2%
 
205.5800.2%
 
160.846770.2%
 
Other values (3913)3609897.3%
 
ValueCountFrequency (%) 
3.541< 0.1%
 
8.3291< 0.1%
 
9.1431< 0.1%
 
9.363190.1%
 
9.3721< 0.1%
 
ValueCountFrequency (%) 
718.3277< 0.1%
 
716.28115< 0.1%
 
700.0964< 0.1%
 
697.75813< 0.1%
 
686.56615< 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

idstart_latstart_lonstop_latstop_lonquantitydone_attrip_starttrip_idcustomer_iddriver_idkilometersstop_start_latstop_start_lontravel_start_timedurationg_distanceturnstotal_distance
010027452.020395.1549251.9832275.079331192019-01-22 01:43:49+00:002019-01-22 00:48:00.239601+00:00243091533.0418.052.0203905.1549202019-01-22 01:01:09.033766+00:0014.98333311.58216193.992
110058252.020395.1549251.8386404.957680112019-01-22 02:11:04+00:002019-01-22 01:01:09.033766+00:00243091533.0418.051.9832275.0793312019-01-22 01:43:49+00:0018.91666721.37812193.992
210032752.020395.1549251.9798984.58396522019-01-22 03:01:26+00:002019-01-22 00:47:59.436893+00:00243091533.0418.051.8386404.9576802019-01-22 02:11:04+00:0038.03333345.40617193.992
310061152.020395.1549251.9609954.543000252019-01-22 03:22:42+00:002019-01-22 00:47:58.148863+00:00243091533.0418.051.9798984.5839652019-01-22 03:01:26+00:0011.5000005.35211193.992
410062952.020395.1549251.9657784.22133122019-01-22 03:52:17+00:002019-01-22 00:50:42.050646+00:00243091533.0418.051.9609954.5430002019-01-22 03:22:42+00:0027.28333329.54014193.992
510054852.020395.1549251.9203524.45159582019-01-22 04:07:54+00:002019-01-22 00:47:56.887189+00:00243091533.0418.051.9657784.2213312019-01-22 03:52:17+00:0019.75000021.59717193.992
610079052.020395.1549251.9099474.44819622019-01-22 04:21:33+00:002019-01-22 00:47:55.106156+00:00243091533.0418.051.9203524.4515952019-01-22 04:07:54+00:005.1500001.80510193.992
710059352.020395.1549251.8917004.566160192019-01-22 04:43:54+00:002019-01-22 00:47:56.389478+00:00243091533.0418.051.9099474.4481962019-01-22 04:21:33+00:0026.63333324.35314193.992
810067852.020395.1549251.8335394.64418812019-01-22 05:05:40+00:002019-01-22 00:50:42.150849+00:00243091533.0418.051.8917004.5661602019-01-22 04:43:54+00:0016.25000012.34720193.992
910056252.020395.1549251.8378974.635991162019-01-22 05:19:44+00:002019-01-22 00:47:53.399479+00:00243091533.0418.051.8335394.6441882019-01-22 05:05:40+00:003.9000001.95410193.992

Last rows

idstart_latstart_lonstop_latstop_lonquantitydone_attrip_starttrip_idcustomer_iddriver_idkilometersstop_start_latstop_start_lontravel_start_timedurationg_distanceturnstotal_distance
37103214350152.0201845.15505252.3858034.84102952021-04-13 10:04:18.296000+00:002021-04-13 04:40:29.650000+00:00145589170.0212.052.3325254.8606032021-04-13 07:25:33.373000+00:0011.0166678.5771258.245
37104214365152.0201845.15505253.2197806.57938012021-04-13 11:58:55+00:002021-04-13 05:33:11.305000+00:00145579114753.0705.052.0201845.1550522021-04-13 05:33:11.305000+00:00122.733333194.52219194.522
37105215277452.0201845.15505252.0571654.49328312021-04-14 00:43:01.482000+00:002021-04-13 23:13:24.520000+00:00145758115347.0152.052.0201845.1550522021-04-13 23:14:52.011000+00:0042.81666756.7812180.855
37106215322752.0201845.15505252.0186044.434019202021-04-14 00:50:28.012000+00:002021-04-13 23:13:47.202000+00:00145758115347.0152.052.0571654.4932832021-04-14 00:43:01.482000+00:0019.20000012.6792280.855
37107215266052.0201845.15505251.8386404.957680112021-04-14 01:07:19.989000+00:002021-04-13 23:39:10.383000+00:00145760115347.064.052.0201845.1550522021-04-13 23:39:10.383000+00:0025.73333332.0061232.006
37108215164752.0201845.15505252.0004564.331418222021-04-14 01:46:04.376000+00:002021-04-13 23:14:52.011000+00:00145758115347.0152.052.0186044.4340192021-04-14 00:50:28.012000+00:0017.68333311.3951880.855
37109214367152.0201845.15505251.5819504.79758012021-04-14 07:41:02.802000+00:002021-04-14 02:54:04.074000+00:001457451415.0284.052.0201845.1550522021-04-14 02:54:04.074000+00:0045.81666763.4101463.410
37110215381952.0201845.15505252.3858034.84102912021-04-14 07:43:09+00:002021-04-14 04:15:53.097000+00:001457481433.0264.052.0201845.1550522021-04-14 04:15:53.097000+00:0039.90000055.1211655.121
37111215386152.0201845.15505252.3444305.61369012021-04-14 07:50:12.703000+00:002021-04-14 05:05:27.413000+00:001457491713.0304.052.0201845.1550522021-04-14 05:05:27.413000+00:0042.08333358.2081458.208
37112215342952.0201845.15505252.0452944.53761192021-04-14 08:04:51.516000+00:002021-04-14 03:14:25.809000+00:001457501215.0143.052.0201845.1550522021-04-14 03:14:25.809000+00:0037.56666752.5931352.593